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SUPERANTIGENS 



TECHNICAL FIELD 
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This invention relates to superantigens, and to their use, including in diagnosis 



and/ or treatment of disease. 
BACKGROUND ART 

10 Bacterial superantigens are the most potent T cell mitogens known. They stimulate 
large numbers of T cells by directly binding to the side of the MHC class II and T cell 
Receptor (TcR) molecules. Because they override the normally exquisite MHC 
restriction phenomenon of T cell antigen recognition, they are prime candidates for 
either causing the onset of autoimmune diseases or exacerbating an existing 

15 autoimmune disorder. 

The applicants have identified genes coding for four novel superantigens from S. 
pyogenes. It is broadly to these superantigens and polynucleotides encoding them 
that the present invention is directed. 



SUMMARY OF THE INVENTION 

In one aspect the invention provides a superantigen selected from any one of SME2- 
2, SPE-G, SPE-H and SPE-J, or a functionally equivalent variant thereof. 

In a further aspect the invention provides a polynucleotide molecule comprising a 
sequence encoding a superantigen chosen from SMEZ-2, SPE-G, SPE-H, SPE-J, or a 
functionally equivalent variant thereof. 

30 In another aspect of the invention there is provided a method of subtyping 
Streptococci on the basis of superantigen genotype comprising detection of the 
presence of any or all of the above four superantigens or the corresponding 
polynucleotides. 
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In a further aspect the invention provides a construct comprising any of the above 
superantigens (or superantigen variants) bound to a cell-targeting molecule, which 
is preferably a tumour-specific antibody. 

In yet a further aspect, the invention provides a pharmaceutical composition for 
therapy or prophylaxis comprising a superantigen or superantigen variant as 
described above linked to cell targeting molecule. 

Other aspects of the invention will be apparent from the description provided below, 
10 and from the appended claims. 

DESCRIPTION OF DRAWINGS 



15 
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While the invention is broadly defined above, it further includes embodiments of 
which the following description provides examples. It will also be better understood 
with reference to the following drawings: 




Fig 1: M ^^iSDmt^^^rBntig e n protein sequences. 



The protein sequence of mature toxins were aligned using the PileUp programme on 
the GCG package. Regions of high sequence identity are in black boxes. The boxes 
below the sequences indicate the structural elements of SPE-C, as determined from 
the crystal structure (Roussel et al 1997 Nat. Struct. Biol. 4 no8:635-43). Regions 
with highest homology correspond to the P4, 05, oc4 and cc5 regions in SPE-C. The 
clear box near the C-terminus represents a primary zinc binding motif, a common 
feature of all toxins shown. The arrows on top of the sequence alignment show the 
regions of sequence diversity between SMEZ and SMEZ-2. 

Figure 2: The nucleotide sequence of the portion of the smez-2 gene (SEQ ID NO. 1) 
30 coding the mature SMEZ-2 superantigen (SEQ ID NO. 2). 

Figure 3: The nucleotide sequence of the portion of the spe-g gene (SEQ ID NO. 3) 
coding the mature SPE-G superantigen (SEQ ID NO. 4). 
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Figure 4: The nucleotide sequence of the portion of the spe-h gene (SEQ ID NO. 5) 
coding the mature SPE-H superantigen (SEQ ID NO. 6). 

Figure 5: The nucleotide sequence of the portion of the spe-j gene (SEQ ID NO. 7) 
5 coding part of the mature SPE-J superantigen (SEQ ID NO. 8). 

Figure 6: Gel electrophoresis of the purified recombinant toxins. 

A. Two micrograms of purified recombinant toxin were run on a 12 5% SDS 
0 polyacrylamide gel to show the purity of the preparations; B. Two micrograms of 
purged recombinant toxin were run on an isoelectric focusing gel (5.5% PAA pH 5- 
8). The isoelectric point (IEP) of rSMEZ-2, rSPE-G and rSPE-H is similar and was 
estimated at pH 7-8. The IEP of rSMEZ was estimated at pH 6-6.5. 

5 Figure?: Stimulation of human T cells with recombinant toxins. 

PBLs were isolated from human blood samples a*d incubated with varying 
concentrations of recombinant toxin. After 3d, 0.1 ^Ci pH]-thymidine was added 
and cells were incubated for another 24h, before harvested and counted on a 
gamma counter. O, unstimulated; A, rSMEZ; Z, rSMEZ-2; rSPE-G; R rSPE-H. 

Figure 8: Jurkat cell assay 

Jurkat cells (bearing a V P 8 TcR) and LG-2 cells were mixed with varying 
concentrations of recombinant toxin and incubated for 24h, before Sel cells were 
added. After Id, 0.1 »Ci pHJ-thymidine was added a*d cells were counted after 
mother 24h. The V P 8 targeting SEE was used as a positive control. The negative 
control was SEA Both SMEZ and SMEZ-2 were potent stimulators of Jurkat cells 
indicating their ability to specifically target V P 8 bearing T cells. O, unstimulated; 
A, rSEA; Z, rSEE; ♦, rSMEZ; ■, rSMEZ-2. 



Figure 9: Zinc dependent binding of SMEZ-2 to LG-2 cells 
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LG-2 cells were incubated in duplicates with 1 ng of laq labelled rSMEZ _ 2 ^ 
increasing amounts of unlabeled toxin at 37°C for In, and then the cells were 

washed and counted. 

S O, incubation in media; A, incubation in media plus ImM EDTA; 2, incubation in 
media plus 1 mM EDTA, 2 mM ZnCl 2 . 

Figure 10: Scatchard analysis of SMEZ-2 binding to LG-2 cells 

10 One nanogram ^I-labeled rSMEZ-2 was incubated in duplicates with LG-2 cells 
and a 2-fold dilution series of cold toxin (10 M g to 10 pg). After lh, cells were 
washed and counted. Scatchard plots were performed as described by Cunningham 
et al 1989 Science 243:1330-1336. 

15 Figure 1 1: Summary of competitive binding experiments. 

Efficiency of each labelled toxin to compete with a 10,000-fold molar excess of any 
other unlabeled toxin for binding to LG-2 cells. □ , no competition; □ 25% 
competition; B , 50% competition; H 75% competition; ■ , 100% competition. 
The results within the boxes are at the bottom right have previously been published 
(Liet al. 1997). 



0 



Figure 12: Competition binding study with SMEZ-2. 

5 LG-2 cells were incubated in duplicates with 1 ng of ^-labeled rSME2 . 2 md 
increasing amounts of unlabeled rSMEZ-2, rSEA, rSEB, rTSST or rSPE-C. After lh 
cells were washed and counted. 

O, rSMEZ-2; A, rSEA; Z, rSEB; S3 rTSST; ♦, rSPE-C. 

Figure 13: Southern blot analysis of genomic DNA with radiolabeled smez. HINDIII 
digested genomic DNA from various Steptococcus isolates was hybridized with a 
radiolabeled smez probe. Band A is a 1953 bp Hindlll DNA fragment that carries 
the smez gene. Bands B and C are DNA fragments of about 4 kbp and 4 2 kbp 
respectively, which both carry a smez like region. 1,S. pyogenes reference strain 
(ATCC 700294, Ml type); 2, isolate 9639 (MNT); 3, isolate 11789 (MNT); 4, isolate 
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11152 (PT2612 type); 5, isolate RC4063 (group C streptococcus); 6, isolate 11070 
(emm65 type); 7, DNA marker lane; 8, isolate 4202 (N25118/M92 type); 9, isolate 
94/229 (M49 type); 10, isolate 11610 (emm57 type); 11, isolate 95/127 
(NZ1437/M89 type); 12, isolate 94/330 (M4 type). 

5 

DESCRIPTION OF THE INVENTION 

The focus of the invention is the identification of four superantigens (SPE-G, SPE-H, 
SPE-J and SMEZ-2) and the corresponding polynucleo tides which encode them. 

Figure 1 shows the amino acidse^uetices of the above four superantigens together 
with thoseofj3reyio*rsly^^ superantigens SMEZ, SPE-C and SEA. 

Of the four superantigens SPE-G, SPE-H, SPE-J and SMEZ-2, the latter is perhaps 
15 of greatest interest. 



The smez-2 gene which encodes SMEZ-2 was identified in an experiment designed 
to produce recombinant SMEZ protein from S. pyogenes 2035 genomic DNA. A full 
length smez gene was isolated from the strain but the DNA sequence of the smez 
20 gene of strain 2035 showed nucleotide changes in 36 positions (or 5%) compared to 
smez from strain Ml (Fig. 1). The deduced protein sequences differed in 17 amino 
acid residues (or 8.1%). This difference establishes this as a new gene, smez-2, and 
the encoded protein as a new superantigen, SMEZ-2. 

25 The most significant difference betw^efi SMEZ and SMEZ-2 is an exchanged 
pentapeptide sequence at posjtkjfi 96- 100, where the EEPMS sequence of SMEZ is 
converted to KTSIL JriJSfrtEZ-2 (Fig. 1). A second difference is at position 111-112, 
where an R^di^eptide is exchanged for GK in SMEZ-2. The remaining 10 different 
resifiw^s are spread over almost the entire primary sequence. 
30 ^^^-^^^ 

Figure 2 shows the nu£le0ticTesequence encoding mature SMEZ-2 and the deduced 
amino acid>g6uence. 
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Likewise, Figures 3 to 5 show the nucleotide segueHt5e^encoding mature SPE-G, 
SPE-H and SPE-J superantigens^j^apre'cSvely, together with their respective 
deduced amino 

The invention is of course not re stiicted^o--etiperantIgens/ polynucleotides having 
the specific soquenc^s^oi-Fig?^ to 5. Instead, functionally equivalent variants 
axe^jeotitemplated. 



10 



15 



20 



25 



The phrase "functionally equivalent variants" recognises that it is possible to vary/ 
the amino acid /nucleotide sequence of a peptide while retaining substantially ( 
equivalent functionality. For example, a peptide can be considered a functional! 
equivalent of another peptide for a specific function if the equivalent peptide is 
immunologically cross- reactive with and has at least substantially the same 
function as the original peptide. The equivalent can be, for example, a fragment of 
the peptide, a fusion of the peptide with another peptide or carrier, or a fusion of a 
fragment which additional amino acids. For example, it is possible to substitute 
amino acids in a sequence with equivalent amino acids using conventional 
techniques- Groups of amino acids normally held to be equivalent are: 

(a) Ala, Ser, Thr, Pro, Gly; 

(b) Asn, Asp, Glu, Gin; 

(c) His, Arg, Lys; 

(d) Met, Leu, He, Val; and 

(e) Phe, Tyr, Trp. 



Equally, nucleotide sequences encoding a particular product can vary significantly 
simply due to the degeneracy of the nucleic acid code. 

Variants can have a greater or lesser degree of homology as between the variant 
30 amino acid /nucleotide sequence and the original. 



35 



Polynucleotide or polypeptide sequences maybe^iigrte^rand percentage of identical 
nucleotides in a specified region^jB^y^be determined against another sequence, 
using computer algoritjjjars^that are publicly available. Two exemplary algorithms 
for aligning ^xKTidentifying the similarity of polynucleotide sequences are the 



WO 00/39159 



PCT/NZ99/00228 



BLASTN and FASTA algorithms. The similarity of polypeptide sequgacfes may be 
examined using the BLASTP algorithm. Both the BLASTN and^BE&STP software are 
available on the NCBI anonymous FTP server (ftp^^ncbi. nlm.nih.gov) under 
/blast /executables/. The BLASTN algorithm version^To.4 [Feb-24- 1998], set to the 
5 default parameters described in the documentation of variants according to the 
present invention. The use of the BLASTxfamiiy of algorithms, including BLASTN 
and BLASTP, is described/^ at NCBI's website at URL 
http: / / www.ncbi.nlm.nih. gov /BLAST / newblast.html and in the publication of 
Altschul, Stephen F., et aL/[1997) t "Gapped BLAST and PSI-BLAST: a new 

10 generation of protein database search programs", Nucleic Acids Res. 25:3389-34023. 
The computer algorithm FASTA is available on the Internet at the ftp site 
ftp: / /ftp. Virginia. edy/pub/fasta/ . Version 2.0u4, February 1996, set to the default 
parameters descrfoed in the documentation and distributed with the algorithm, is 
also preferred/for use in the determination of variants according to the present 

15 invention. 2ne use of the FASTA algorithm is described in W. R. Pearson and D. J. 
Lipman, ^Lniproved Tools for Biological Sequence Analysis", Proc Natl Acad. Sci. USA 
85:24^4-2448 (1988) and W. R. Pearson, a Rapid and Sensitive Sequence 
Conrparison with FASTP and FASTA, u Methods in Enzymology 183:63-98 (1990). 

20 The following running parameters are preferred for determination of alignments and 
similarities using BLASTN that contribute to E values (as discussed below) and 
percentage identity: Unix running command: blastail -p blastn -d embldb -e 10 -G 1 
-E 1 -r 2 -v 50 -b 50 -I queryseq -o results; and parameter default values: 
-p Program Name [String] 

25 -d Database [String] 

-e Expectation value (E) [Real] 

-G Cost to open a gap (zero invokes default behaviour) [Integer] 
-E Cost to extend a cap (zero invokes default behaviour) [Integer] 
-r Reward for a nucleotide match (blastn only) [Integer] 
30 -v Number of one-line descriptions (V) [Integer] 
-b Number of alignments to show (B) [Integer] 
-i Query File [File In] 

-o BLAST report Output File [File Out] Optional 

For BLASTP the following running parameters are preferred: blastail -p blastp -d 
35 swissprotdb -e 10 -G 1 -E 1 -v 50 -b 50 -I queryseq -o results 
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-p Program Name [String] 

-d Database [String] 

-e Expectation value (E) [Real] 

-G Cost to open a gap (zero invokes default behaviour) [Integer] 
5 -E Cost to extend a cap (zero invokes default behaviour) [Integer] 
-v Number of one-line descriptions (v) [Integer] 
-b Number of alignments to show (b) [Integer] 
-i Query File [File In] 

-o BLAST report Output File [File Out] Optional 

10 

The "hits" to one or more database sequences by a queried sequence produced by 
BLASTN, BLASTP, FASTA, or a similar algorithm, align and identify similar portions 
of sequences. The hits are arranged in order of the degree of similarity and the 
length of sequence overlap. Hits to a database sequence generally represent an 
1 5 overlap over only a fraction of the sequence length of the queried sequence. 

The BLASTN and FASTA algorithms also produce "Expect" or E values for 
alignments. The E value indicates the number of hits one can "expect" to see over a 
certain number of contiguous sequences by chance when searching a database of a 

20 certain size. The Expect value is used as a significance threshold for determining 
whether the hit to a database, such as the preferred EMBL database, indicates true 
similarity. For example, an E value of 0.1 assigned to a hit is interpreted as 
meaning that in a database of the size of the EMBL database, one might expect to 
see 0. 1 matches over the aligned portion of the sequence with a similar score simply 

25 by chance. By this criterion, the aligned and matched portions of the sequences 
then have a 90% probability of being the same. For sequences having an E value of 
0.01 or less over aligned and matched portions, the probability of finding a match by 
chance in the EMBL database is 1% or less using the BLASTN or FASTA algorithm. 

30 According to one embodiment, "variant" polynucleotides, with reference to each of 
the polynucleotides of the present invention, preferably comprise sequences having 
the same number or fewer nucleic acids than each of the polynucleotides of the 
present invention and producing an E value of 0.01 or less when compared to the 
polynucleotide of the present invention. That is, a variant polynucleotide is any 

35 sequence that has at least a 99% probability of being the same as the 
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polynucleotide of the present invention, measured as having an E value of 0.01 or 
less using the BLASTN or FASTA algorithms set at the parameters discussed above. 

Variant polynucleotide sequences will generally hybridize to the recited 
5 polynucleotide sequence under stringent conditions. As used herein, "stringent 
conditions 3 refers to prewashing in a solution of 6X SSC, 0.2% SDS; hybridizing at 
65°C, 6X SSC, 0.2% SDS overnight; followed by two washes of 30 minutes each in 
IX SSC, 0.1% SDS at 65°C and two washes of 30 minutes each in 0.2X SSC, 0.1% 
SDS at 65°C. 

10 

The superantigens of the invention together with their fragments and other variants 
may be generated by synthetic or recombinant means. Synthetic polypeptides 
having fewer than about 100 amino acids, and generally fewer than about 50 amino 
acids, may be generated by techniques well known to those of ordinary skill in the 

15 art. For example, such peptides may be synthesised using any of the commercially 
available solid- phase techniques such as the Merryfield solid phase synthesis 
method, where amino acids are sequentially added to a growing amino acid chain 
(see Merryfield, J. Am. Chem. Soc 85: 2146-2149 (1963)). Equipment for automative 
synthesis of peptides is commercially available from suppliers such as Perkin 

20 Elmer/ Applied Biosy stems, Inc. and may be operated according to the 
manufacturers instructions. 

Each superantigen, or a fragment or variant thereof, may also be produced 
recombinantly by inserting a polynucleotide (usually DNA) sequence that encodes 

25 the superantigen into an expression vector and expressing the superantigen in an 
appropriate host. Any of a variety of expression vectors known to those of ordinary 
skill in the art may be employed. Expression may be achieved in any appropriate 
host cell that has been transformed or transfected with an expression vector 
containing a DNA molecule which encodes the recombinant protein. Suitable host 

30 cells includes procaryotes, yeasts and higher eukaryotic cells. Preferably, the host 
cells employed are £. co/t, yeasts or a mammalian cell line such as COS or CHO, or 
an insect cell line, such as SF9, using a baculovirus expression vector. The DNA 
sequence expressed in this matter may encode the naturally occurring 
superantigen, fragments of the naturally occurring protein or variants thereof. 
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DNA sequences encoding the superantigen or fragments may be obtained, for 
example, by screening an appropriate S. pyogenes cDNA or genomic DNA library for 
DNA sequences that hybridise to degenerate oligonucleotides derived from partial 
amino acid sequences of the superantigen. Suitable degenerate oligonucleotides 
may be designed and synthesised by standard techniques and the screen may be 
performed as described, for example, in Maniatis et al Molecular Cloning - A 
Laboratory Manual, Cold Spring Harbour Laboratories, Cold Spring Harbour, NY 
(1989). 

Identification of these superantigens and of their properties gives rise to a number 
of useful applications. A first such application is in the genotyping of organisms by 
reference to their superantigen profile. 

An illustration of this is subtyping of strains of S. pyogenes. 

One feature which has been observed is that all clones of S. pyogenes so far found 
to be positive for SMEZ express either SMEZ- 1 or SMEZ-2 but not both. Thus they 
are mutually exclusive, which enables a rapid diagnostic test which tells whether an 
isolate or a patient sample is either SMEZ-1 +ve or SMEZ-2 +ve. This will assist in 
the typing of the isolate. 

This general diagnostic approach is most simply achieved by providing a set or 
primers which amplify either all or a subset of superantigen genes and that 
generate gene specific fragments. This can be modified to provide a simple 
qualitative ELISA-strip type kit that detects biotin labelled PCR fragments amplified 
by the specific primers and hybridised to immobilised sequence specific probes. 
This has usefulness for screening patient tissue samples for the presence of 
superantigen producing streptococcal strains. 

Such approaches are well known and well understood by those persons skilled in 
the art. 

Another approach is to provide monoclonal antibodies to detect each of the 
streptococcal superantigens. An ELISA kit containing such antibodies would allow 
the screening of large numbers of streptococcal isolates. A kit such as this would be 
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useful for agencies testing for patterns in streptococcal disease or food poisoning 
outbreaks. 

Another potential diagnostic application of the superantigens of the invention is in 
5 the diagnosis of disease, such as Kawasaki Syndrome (KS). 

KS is an acute multi- system vasculitis of unknown aetiology. It occurs world-wide 
but is most prevalent in Japan or in Japanese ancestry. It primarily affects infants 
and the young up to the age of 16. It is an acute disease that without treatment, 
10 can be fatal. Primary clinical manifestations include 

• Prolonged fever 

• Bilateral non-exudative conjunctivitis 

• Indurtation and erythema of the extremities 

• Inflammation of the lips and oropharynx 
15 • Polymorphous skin rash 

• Cervical lymphoadenopathy 

• In 15-25% of cases, coronary arterial lesions develop. 
These indications are used as a primary diagnosis of KS. 

20 

In Japan and the US, KS has become one of the most common causes of acquired 
heart disease in children. Treatment involves the immediate intravenous 
administration of gamma globulin (IVGG) during the acute phase of the disease and 
this significantly reduces the level of coronary lesions. 

25 

There are two clear phases to the disease, an acute phase and a convalescent 
phase. The acute phase is marked by strong immune activation. Several reports 
have suggested that superantigens are involved and many attempts have been 
made to link the disease to infection with superantigen producing strains of 
30 Streptococcus pyogenes. Features of the acute phase of KS are the expansion of Vp 
2 and to a lesser extent VP8 bearing T cells and an increase of DR expression T cells 
(a hallmark of T cell activation). 

Because SMEZ-2 stimulates both Vp2 and Vp8 bearing T cells, testing for SMEZ-2 
35 production is potentially very useful in the diagnosis of KS. 
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Antibodies to the superantigens for use in applications such as are described above 
are also provided by this invention. Such antibodies can be polyclonal but will 
preferably be monoclonal antibodies. 

5 

Monoclonal antibodies with affinities of 10 s M" 1 or preferably 10- 9 to 10*° M- 1 or 
stronger will typically be made by standard procedures as described, eg. in Harlow & 
Lane (1988) or Goding (1986). Briefly, appropriate animals will be selected and the 
desired immunization protocol followed. After the appropriate period of time, the 
10 spleens of such animals are excised and individual spleen cells fused, typically, to 
imm ortalised myeloma cells under appropriate selection conditions. Thereafter, the 
cells are clonally separated and the supernatants of each clone tested for then- 
production of an appropriate antibody specific for the desired region of the antigen. 

15 Other suitable techniques for preparing antibodies well known in the art involve in 
vitro exposure of lymphocytes to the antigenic polypeptides, or alternatively, to 
selection of libraries of antibodies in phage or similar vectors. 

Also, recombinant immunoglobulins may be produced using procedures known in 
20 trie art (see, for example, US Patent 4,816,567 and Hodgson J. (1991)). 

The antibodies may be used with or without modification. Frequently, antibodies 
will be labeled by joining, either covaleritly or non-covalently, a substance which 
provides for a detectable signal. A wide variety of labels and conjugation techniques 
25 are known and are reported extensively in the literature. Suitable labels include 
radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent agents, 
chemiluminescent agents, magnetic particles and the like. Patents teaching the use 
of such labels include US Patents 3,817,837; 3,850,752; 3,939,350; 3,996,345; 
4,277,437; 4,275,149; and 4,366,241. 

30 

The immunological assay in which the antibodies are employed can involve any 
convenient format known in the art. 



35 



The nucleotide sequence information provided herein may be used to design probes 
and primers for probing or amplification of parts of the smez-2, spe-g, spe-h and 
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spe-j genes. An oligonucleotide for use in probing or PCR may be about 30 or fewer 
nucleotides in length. Generally, specific primers are upwards of 14 nucleotides in 
length. For optimum specificity and cost effectiveness, primers or 16-24 nucleotides 
in length are preferred. Those skilled in the art are well versed in the design of 
5 primers for use in processes such as PCR. 

If required, probing can be done with entire polynucleotide sequences provided 
herein as SEQ ID NOS 1, 3, 5 and 7, optionally carrying revealing labels or reporter 
molecules. 

10 

O Such Probes and primers also form aspects of the present invention. 



OJ Probing may employ the standard Southern blotting technique. For instance, DNA 

U may be extr acted from cells and digested with different restriction enzymes. 

Jfj 15 Rest riction fragments may then be separated by electrophoresis on an agarose gel, 

r before denaturation and transfer to a nitrocellulose filter. Labelled probes may be 

O hybridised to the DNA fragments on the filter and binding determined. DNA for 

flj probing may be prepared from RNA preparations from cells. Probing may optionally 

2 be done bv means of so-called "nucleic acid chips' (see Marshall and Hodgson 
20 (1998) Nature Biotechnology 16:27-31). 



In addition to diagnostic applications, another application of the superantigens is 
reliant upon their ability to bind to other cells. 



25 One of the most important features of superantigens is that they bind a large 
number or T cell receptor molecules by binding to the Vp domain. They are the 
most potent of all T cell mitogens and are therefore useful to recruit and activate T 
cells in a relatively non-specific fashion. 

30 This ability enables the formation of constructs in which the superantigen (or at 
least the T-cell binding portion of it) is coupled to a cell-targeting molecule, 
particularly an antibody, more usually a monoclonal antibody. 



35 



When a monoclonal antibody that targets a specific cell surface antigen (such as a 
tumor specific antigen) is coupled to a superantigen in such a construct, this 
generates a reagent that on the one hand will bind specifically to the tumor cell, and 
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on the other hand recruit and selectively active T cells for the purpose of killing the 
targeted cell. 

Bi-specific constructs of this type have important applications in therapy 
5 (particularly cancer therapy) and again may be prepared by means known to those 
skilled in art. For example SMEZ-2 may be coupled to a tumor specific monoclonal 
antibody. The constructs may be incorporated into conventional carriers for 
pharmaceuticaily- active proteins. 

10 Various aspects of the invention will now be described with reference to the 
following experimental section, which is included for illustrative purposes. 

EXAMPLE 

15 SECTION A: SUPERAWTIGEN IDENTIFICATION AND CHARACTERISATION 
Materials and. Methods 



20 
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Identification of novel SACs 

The novel superantigens were identified^by^jfia^faiiirinT^ pyogenes Ml genome 
database at the University^pi^lcIah^mT (http://www.genome.ou.edu/strep.html) 
with hightycorisert?eap5 and a4regions of streptococcal and staphylococcal 
superjaHtigens, using a TBlastN search programme. 



25 



30 



The open reading frames were defined by translating the DNA sequences around the 
matching regions and aligning the protein sequences to known superantigens using 
the computer programes Gap. Multiple alignments and dendrograms were 
performed with Lineup and Pileup. The FAST A programme was used for searching 
the SwissProt (Amos Bairoch, Switzerland) and PIR (Protein Identification Resource, 
USA) protein databases. 



The leader sequences of SPE-G and SPE-H were predicted using the SP Scan 
programme All computer programmes are part of the GCG package (version 8). 
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Cloning ofsmez, smez-2, spe-g and spe-h. *r 
Fifty nanograms of S.pyogenes Ml (ATCC 700294) or S. pyogenes genomic DNA 

was used as a template to amplify the smez DNA fragmeprtfand the smez-2 DNA 
fragment, respectively, by PCR using the primers y< 
5 smez-forward (TG GG ATC CTT AG AAGTAG AT AAT A) arfd 

smez- reverse (AAGAATTCTTAGGAGTCAATTTp^and Taq Polymerase (Promega). The 
primers contain a terminal tag with thexfestriction enzyme recognition sequences 
BamHI and EcoRI, respectively. The/amplified DNA fragment, encoding the mature 
protein without the leader seq*fence (Kamezawa et al, 1997 Infect. Immun. 65 
10 no9:38281-33) was cloned in*o a T-tailed pBlueScript SKU vector (Stratagene). 

Spe-g and spe-h w^are cloned in a similiar approach, using the primers spe-g- fw 
(CTGGATCCG^raAAAATTTAAAAGATTTAA) and spe-g-rev 
(AAGAATpefGGGGGGAGAATAG), and primers spe-h- fw 
15 (TTGja^TC C AATTCTTATAATAC AAC C) and spe-h-rev 
(AAAAGCT1T 1' AGCTG ATTG AC AC) , respectively. 

The DNA sequences of the subcloned toxin genes were confirmed by the dideoxy 
chain termination method using a Licor automated DNA sequencer. As the DNA 
20 sequences from the genomic database are all unedited raw data, 3 subclones of 
every cloning experiment were analyzed to ensure that no Taq polymerase related 
mutations were introduced. 

Expression and purification of rSMEZ, r SMEZ-2, rSPE-G and rSPE-H. 

25 Subcloned smez, smez-2 and spe-g fragments were cut from pBlueScript SKU 
vectors, using restriction enzymes BamHI and EcoRI (LifeTech), and cloned into 
pGEX-2T expression vectors (Pharmacia). Due to an internal EcoRI restriction site 
within the spe-H gene, the pBlueScript: spe-h subclone was digested with BamHI 
and Hindlll and the spe-h fragment was cloned into a modified pGEX-2T vector that 

30 contains a Hindlll 3'cloning site. 

Recombinant SMEZ, rSMEZ-2 and rSPE-H were expressed in E.coli DH 5a cells as 
glutathione-S-tranferase (GST) fusion proteins. Cultures were grown at 37° C and 
induced for 3-4 h after adding 0.2 mM isopropyl-p-D-thiogalactopyranoside (IPTG). 
35 GST - SPE-G fusion protein was expressed in cells grown at 28° C. 
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The GST fusion proteins were purified on glutathione agarose as described 
previously (Li et al, 1997, and the mature toxins were cleaved off from GST by 
trypsm digestion. All recombinant toxins, except rSMEZ, were further purified by 
rwo rounds of cation exchange chromatography using carboxy methyl sepharose 
(Pharmacia). The GST-SMEZ fusion protein was trypsin digested on the GSH-column 
and the flow through containing the SMEZ was collected. 

Gel electrophoresis 

All purified recombuxant toxins were tested on a 12% SDS-polyacrvlamide gel 
according the procedure of Laemmli. The isoelectric point of the recombinant toxins 
was determined by isoelectric focusing on a 5.5% polyacrylamide gel using 
ampholine P H 5-8 (Pharmacia Biotech,. The gel was run for 90 min at 1 W constant 
power. 

Toxin proliferation assay 

Human peripheral blood lymphocytes (PBL, were purified from blood of a healthy 
donor by Histopaque Ficoll (Sigma, fractionation. The PBL were incubated in 96-well 
round bottom microtiter plates at l 0 s cells per well with RPMI-10 (RPMI with 10% 
fetal calf serum, containing varying dilutions of recombinant toxins. The dilution 
senes was performed in 1:5 steps from a starting concentration of 10 ng/ml of toxin 
Pipette txps were changed after each dilution step. After 3 days 0 1 »Ci 
PHJthymidine was added to each well and cells were incubated for another 24 h 
Cells were harvested and counted on a scintillation counter. 



Mouse leukocytes were obtained from spleens of 5 different mouse strains (SJL 
B10.M, B10/J, C3H and BALB/CJ. Splenocytes were washed in DMEM-10, counted 
m 5% aceuc acid and incubated on microtiter plates at IQS ceUs per well with 
DMEM-10 and toxins as described for human PBLs. 
30 TcR Vp analysis. 

V P enrichment analysis was performed by anchored multiprimer amplification 
(Hudson et al, 1993, J exp Med 177:175-185). Human PBLs were incubated with 20 
Pg/ml of recombinant toxin at 10* cells/ml for 3 d. A two-fold volume expansion of 
the culture followed with medium containing 20 ng/ml IL-2. After another 24h 
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stimulated and resting cells were harvested and RNA was prepared using Trizol 
reagent (Life Tech). A 500 bp p-chain DNA probe was obtained by anchored 
multiprimer PCR as described previously (38), radiolabeled and hybridized to del (36) 
individual Vp s and a Cp DNA region dot blotted on a Nylon membrane. The 
membrane was analysed on a Molecular Dynamics Storm Phosphor imager using 
ImageQuant software. Individual Vf3s were expressed as a percentage of all the V(3s 
determined by hybridization to the Cp probe. 

Jurknt cell assay 

Jurkat cells (a human T cell line) and LG-2 cells (a human B lymphoblastoid cell 
line, homozygous for HLA-DR1) were harvested in log phase and resuspended in 
RPMI-10. One hundred microliter of the cell suspension, containing 1x10= Jurkat 
cells and 2x10* LG-2 cells were mixed with 100 ^1 of varying dilutions of 
recombinant toxins on 96 well plates. After incubating overnight at 37° C, 100 jd 
aliquots were transfered onto a fresh plate and 100 nl (lxlC) of Sel cells (IL-2 
dependent murine T cell line) per well were added. After incubating for 24 h, 0. 1 ^Ci 
PH]thymidine was added to each well and cells were incubated for another 24 h. 
Cells were harvested and counted on a scintillation counter. As a control, a dilution 
series of IL-2 was incubated with Sel cells. 

Computer aided modelling of protein structures 

Protein structures of SMEZ-2, SPE-G and SPE-H were created on a Silicon Graphics 
computer using Insightll/Homology software. The superantigens SEA, SEB and 
SPE-C were used as reference proteins to determine structurally conserved regions 
(SCRs). Coordinate files for SEA (1ESF), for SEB (1SEB) and for SPE-C (1AN8) were 
downloaded from the Brookhaven Protein Database. The primary amino acid 
sequences of the reference proteins and SMEZ-2, SPE-G and SPE-H, respectively, 
were aligned and coordinates from superimposed SCR's were assigned to the model 
proteins. The loop regions between the SCRs were generated by random choice. 
MolScript software (PJ Kraulis, 1991, J App Critallography 24:946-50) was used for 
displaying the computer generated images. 



Radiolabeling and LG-2 binding experiments 

Recombinant toxin was radioiodinated by the chloramine T method as previously 
35 described (by Li et al. 1997). Labeled toxin was seperated from free iodine by size 
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exclusion chromatography using Sephadex G25 (Pharmacia). LG2 cells were used 
for cell binding experiments, as described (Li et al. 1997). Briefly, cells were 
harvested, resuspended in RPMI-10 and mixed at 10* cells/ml with i^-tracer toxin 
(1 ng) and 0.0001 to 10 ug of unlabeled toxin and incubated at 37° C for 1 h. After 
washing with ice cold RPMI- 1 the pelleted cells were analyzed in a gamma counter. 
For zinc binding assays the toxins were incubated in either RPMI-10 alone, in RPMI- 
10 with 1 mM EDTA or in RPMI-10 with ImM EDTA, 2 mM ZnCl 2 . 

Scatchard analysis was performed as described by Cunningham et al. (1989). For 

competitive binding studies, 1 ng of i**- tracer toxin (rSMEZ, rSMES-2, rSPE-G, 

rSPE-H, rSEA, rSPE-C, or rTSST) was incubated with 0.0OO1 to 10 ng of unlabeled 

Jg toxin (rSMEZ, rSMES-2, rSPE-G, rSPE-H, rSEA, rSEB, rSPE-C, and rTSST) for Ih. 

J* F ° r SEB inhibition studies, 20 ng of i^I-rSEB was used as tracer and samples were 

incubated for 4h. 
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Results 

Identification and sequence analysis of superantigens. 

The Oklahoma University Streptococcus pyogenes Ml genome database is accessible 
via the internet and contains a collection of more than 300 DNA sequence contigs 
derived from a shot gun plasmid library of the complete S. pyogenes Ml genome. 
The currently available DNA sequences cover about 95% of the total genome. This 
database was searched with a highly conserved superantigen peptide sequence, 
using a search program that screens the DNA database for peptide sequences in all 
6 possible reading frames. 8 significant matches and predicted the open reading 
frames (ORFs) were found by aligning translated DNA sequences to complete protein 
sequences of known SAgs. 

Five matches gave complete ORFs with significant homology to streptococcal and 
staphylococcal superantigens. Three of these ORFs correlate to SPE-C, SSA and the 
recently described SMEZ (Kamezawa et al. 1997), respectively. The remaining two 
ORFs could not be correlated to any known protein in the SwissProt and PIR 
databases. These novel putative superantigen genes were named spe-g and spe-h 
(see Figs 3 and 4). One ORF could not be generated completely due to its location 
close to the end of a contig. The DNA sequence of the missing 5'-end is located on 
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another contig, and individual contigs have yet to be be assembled in the database. 
However, the available sequence shows an ORF for the 137 GOOH-terminal amino 
acid residues of a putative novel superantigen which could not be found in the 
existing protein databases. This gene was named spe-j (see Fig. 5). 

5 

In two cases a complete ORF could not be defined due to severed out-of-frame 
mutations. Although DNA sequencing errors on the unedited DNA sequences 
cannot be completely ruled out, the high frequency of inserts and deletions probably 
represent natural mutation events on pseudogenes, which are no longer used. 

10 

To produce recombinant proteins of SMEZ, SPE-G and SPE-H, individual genes 
(coding for the mature toxins without leader sequence) were amplified by PCR, and 
subcloned for DNA sequencing. Both, Str. pyogenes Ml and Str. pyogenes 2035 
genomic DNA were used and individual toxin gene sequences compared between the 

15 two strains. The spe-h gene was isolated from Ml strain, but could not be amplified 
from strain 2035 genomic DNA suggesting a restricted strain specificity for this 
toxin. The spe-g gene was cloned from both Ml and 2035, and DNA sequence 
analysis of both genes showed no differences. The full length smez gene was isolated 
from both strains, but DNA sequence comparison revealed some striking 

20 differences. The smez gene of strain 2035 showed nucleotide changes in 36 
positions (or 5%) compared to smez from strain Ml (Fig. 1). The deduced protein 
sequences differed in 17 amino acid residues (or 8.1%). This difference was 
sufficient to indicate a new gene. This gene was named smez-2, because it is 95% 
homologous to smez (see Fig. 2). 

25 

The most significant difference between SMEg^aKd^SMEZ-2 is an exchanged 
pentapeptide sequence at position9£^wWKl7where the EEPMS sequence of SMEZ is 
converted to KTSIL in SMg^^ig. 1). A second cluster is at position 111-112, where 
an RR dipepticle-^isexchanged for GK in SMEZ-2. The remaining 10 different 
30 resjjiwesare spread over almost the entire primary sequence. 



A revised superantigen family tree, based on primary amino acid sequence 
homology now shows 3 general subfamilies; group A comprises SPE-C, SPE-J, SPE- 
G, SMEZ and SMEZ-2, group B comprises SEC 1-3, SEB, SSA, SPE-A and SEG and 
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group C comprises SEA, SEE, SED, SEH and SEI. Two superantigens, TSST and 
SPE-H do not belong to any one of those subfamilies. 

SMEZ, SMEZ-2, SPE-G and SPE-J are most closely related to SPE-C, increasing the 
5 number of this subfamily from 2 to 5 members. SPE-G shows the highest protein 
sequence homology with SPE-C (38.4% identity and 46.6% similarity). The 
homology of SPE-J to SPE-C is even more significant (56% identity and 62% 
similarity), but this comparison is only preliminary due to the missing NH 2 -terminal 
sequence- SMEZ shows 30.9% / 40.7% homology to SPE-C and SMEZ-2 is 92% / 
10 93% homologous to SMEZ. 



SPE-H builds a new branch in the family tree and is most closely related to SED, 
showing 25% identity and 37.3% similarity. 

15 Multiple alignment of SAg protein sequences (Fig. l)sJ^ow^that similarities are 
clustered within structure determining regicm^fepresented by ot4, a.5, p4 and (35 
regions. This applies to all to^n^jPirKesuperantigen family (data not shown) and 
explains whysupw^iitigeiislike SPE-C and SEA have very similar overall structures 
despite'toeir rather low sequence identity of 24.4 %. 
20 

Although SPE-H is less related to SPE-C it shows 2 common features with the "SPE- 
C subfamily": (I) a truncated NH 2 -terminus, lacking the al region and (II) a primary 
zinc binding motif (H-X-D) at the C -terminus (Fig. 1). It has been shown for several 
superantigens that this motif is involved in a zinc coordinated binding to the p-chain 
25 of HLA-DR1. 



Fusion proteins of GST-SMEZ, GST-SMEZ-2 and GST-SPE-H were completely 
soluble and gave yields of about 30 mg per liter. The GST-SPE-G fusion was 
insoluble when grown at 37° C, but mostly soluble when expressed in cells growing 
30 at 28° C. Although soluble GST-SPE-G yields were 20-30 mg per liter, solubility 
decreased after cleavage of the fusion protein with trypsin. Soluble rSPE-G was 
achieved by diluting the GST-SPE-G to less than 0.2 mg/ml prior to cleavage. After 
cation exchange chromatography, purified rSPE-G could be stored at about 0.4 
mg/ ml. 



35 
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Recombinant SME2 could not be separated from GST by ion exchange 
chromatography. Isoelectric focusing revealed that the isoelectric points of the two 
proteins are too similar to allow separation {data not shown). Therefore, rSMEZ was 
released from GST by cleaving with trypsin while still bound to the GSH agarose 
5 column. Recombinant SMEZ was collected with the flow through. 

The purified recombinant toxins were applied to SDS-PAGE and isoelectric focusing 
(Fig. 6). Each toxin ran as a single band on the SDS PAA gel confirming their purity 
and their calculated molecular weights of 24.33 (SMEZ), 24.15 (SMEZ-2), 24.63 
10 (SPE-G) and 23.63 (SPE-H) (Fig. 6A). The isoelectric focusing gel (Fig. 6B) shows a 
significant difference between rSMEZ and rSMEZ-2. Like most other staphylococcal 
and streptococcal toxins, rSMEZ-2 possesses a slightly basic isoelectric point at pH 
7-8, but rSMEZ is acidic with an IEP at pH 6-6.5. 

15 T cell proliferation and VJ3 specificity 

To ensure the native conformation of the purified recombinant toxins, a standard 
[ 3 H]thymidine incorporation assay was performed to test for their potency to 
stimulate peripheral blood lymphocytes (PBLs). All toxins were active on human T 
cells (Fig. 7). Recombinant SEA, rSEB, rSPE-C and rTSST were included as 

20 reference proteins. The mitogenic potency of these toxins was lower than described 
previously, but is regarded as a more accurate figure. In previous studies, a higher 
starting concentration of toxin (100 ng/ml) was used and tips were not changed in 
between dilutions. This led to significant carryover across the whole dilution range. 
On this occasion, the starting concentration was 10 ng/ml and tips were changed in 

25 between dilutions preventing any carryover. 

The half maximal response for rSPE-G and rSPE-H was 2 pg/ml and 50 pg/ml, 
respectively. No activity was detected at less than 0.02 pg/ml and 0.1 pg/ml, 
respectively. Both toxins are therefore less potent than rSPE-C. Recombinant SMEZ 
30 was similar in potency to rSPE-C, with a P 5 o% value of 0.08 pg/ml and no detectable 
proliferation at less than 0.5 fg/ml. Recombinant SMEZ-2 showed the strongest 
mitogenic potency of all toxins tested or, as far as can be determined, described 
elsewhere. The Pso% value was determined at 0.02 pg/ml and rSMEZ-2 was still 
active at less than 0. 1 pg/ml. All P5o% values are summarized in Table 1. 
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TABLE 1 



POTENCY OF RECOMBINANT TOXINS ON HUMAN AND MOUSE T CELLsT 
PROLIFERATION POTENTIAL P 50 % [pg/ml] 



TOXIN 



HUMAN 



SJL 



B10.M 



BIO/ J 



C3H 



BALB/C 



SEA 


0.1 


20 


12 


1.8 


19 


1000 


SEE 


0.2 


10 


12 


1.5 


50 


15 


SEB 


0.8 


7000 


80,000 


5000 


10,000 


1000 


TSST 


0.2 


20 


1000 


1.2 


100 


10 


SPE-C 


0.1 


> 100,000 


> 100,000 


> 100,000 


> 100,000 


> 100,000 


SMEZ 


0.08 


80 


80 


100 


9000 


200 


SMEZ-2 


0.02 


100 


15 


10 


800 


18 


SPE-G 


2 


> 100,000 


> 100,000 


> 100,000 


> 100,000 


> 100,000 


SPE-H 


50 


15 


800 


5000 


100 


1000 



Human PBLs and mouse T cells were stimulated with varying amounts of recombinant toxin. The value 
reflects «ne concentration of recombinant toxin required to induce 50% maxima, cell proliferation. No 
proliferation was detected for rSPE-C and rSPE-G at any concentration tested on murine T cells 
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Murine T cells from 5 different mouse strains were tested for their mitogenic 
response to rSMEZ, rSMEZ-2, rSPE-G and rSPE-H (Table 1). Recombinant SPE-G 
showed no activity against any of the mouse strains tested. Recombinant SPE-H, 
rSMEZ and rSMEZ-2 showed varied potency depending on the individual mouse 
5 strain. For example, rSMEZ-2 was 500-fold more potent than rSPE-H in the BIO/ J 
strain, while rSPE-H was 7. 5- fold more active than rSMEZ-2 in the SJL strain. 

The most consistently potent toxiri on murine T cells was rSMEZ-2 with Pso% values 
of 10 pg/ml in BIO/ J and 800 pg/ml in C3H. Recombinant SMEZ varied between 80 
10 pg/ml in SJL and B10.M and 9000 pg/ml in C3H. The Pso% value for rSPE-H was 
between 15 pg/ml in SJL and 5000 pg/ml in B10/ J. 
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TABLE 2 



Vp SPECIFICITY OF RECOMBINANT TOXINS ON HUMAN PELS. 



PERCENT Vp ENRICHMENT 


vp 


Resting 


SMEZ 






SPE-H 


1.1 


0 2 






1.2 


1 


2. 1 


0.4 


O. *T 


1 


17.9 


8.6 


3.2 


4 ft 


1 1 

O. A 


2.5 


3 


2.4 


4. 1 




-£*r. o 


14.4 


1 1.2 


5.2 


5.1 


6.2 


1 4 


z, . o 


D.7 


2.2 


5.3 


5.6 


z. . SL, 


/I t 

*f . 1 


4.7 


4.1 


6.3 


O 


U.O 


2.3 


4.7 


3.5 


6.4 






5.9 


9.6 


5.6 


6.9 


6.9 


3.5 


9.3 


19.1 


12.2 


7.3 


3.5 


15.3 


7.3 


3.2 


12.6 


7.4 


9 


13.5 


11.7 


2.9 


6.3 


8.1 


8.7 


20.7 


36 


4.5 


2.4 


9.1 


0.3 


0.05 


O 


1.2 


2.3 


12.3 


0.8 


1.6 


2 


3.2 


2.6 


12.5 


3 


1.2 


2 


3 


2.3 


15.1 


0.6 


0.5 


0.7 


1.2 


0.8 


23.1 


0.2 


0.1 


0.3 


0.8 


I 


total 


62.1 


99.7 


102.8 


97.1 


75.2 



Human PBLs were incubated with 20 pg/ml of recombinant toxin for 4d. Relative enrichment of Vp cDNAs was 
analyzed from RNA of stimulated and re ting PBLs by anchored primer PGR and reverse dot blot to a panel of 17 
different Vp cDNAs. 



The values representing the highest Vp enrichment arc underlined. 
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The human TcR Vp specificity of the recombinant toxins was determined by 
multiprimer anchored PCR and dot blot analysis using a panel of 17 human V|3 DNA 
regions. The Vp enrichment after stimulation with toxin was compared to the VP 
profile of unstimulated PBLs (Table 2). The sum total of all VPs stimulated by 
5 rSMEZ, rSMEZ-2 and rSPE-G was close to 100 % suggesting that the Vps used in 
the panel represent all the targeted Vps. On the other hand, the total of the V£s 
stimulated by rSPE-H was only 75%. It is therefore likely that rSPE-H also 
stimulated some less common Vps, which are not represented in the panel. The 
most dramatic response was seen with all toxins, except rSMEZ2, on Vp2.1 bearing 

10 T cells (21-fold for rSMEZ, 45-fold for rSPE-G and 22-fold for rSPE-H). In contrast, 
rSMEZ2 gave only a 2.5-fold increase of Vp2.1 T-cells. SPE-G also targeted Vp4. 1, 
Vp6.9, VP9.1 and Vpi2.3 (3-4 fold). A moderate enrichment of V012.6, Vp9.1 and 
VP23.1 (4-8 fold) was observed with rSPE-H. Both, rSMEZ and rSMEZ2, targeted 
VP4.1 and Vp8. 1 with similiar efficiency (3-7-fold). This finding is of particular 

15 interest, because VP8.1 activity had been found in some, but not all Str. pyogenes 
culture supernatants and in crude preparations of SPE-A and SPE-C. Moreover, 
SPE-B has often been claimed to have Vp8 specific activity, but has since been 
shown to be a contaminant previously called SpeX. The ability of rSMEZ and 
rSMEZ-2 to stimulate the Vp8. 1 Jurkat cell line was tested (Fig. 8) Recombinant 

20 SMEZ was less potent than the control toxin (rSEE), showing a half maximal 
response of 0.2 ng/ml, compared to 0.08 ng/ml with rSEE, but rSMEZ-2 was more 
potent than rSEE (0.02 ng/ml). No proliferation activity was observed with the 
negative control toxin rSEA. 



25 MHC class E binding 

To determine if there were significant structural differences, the protein structures 
of SMEZ- 2, SPE-G and SPE-H were modelled onto the superimposed structurally 
conserved regions of SEA, SEB and SPE-C. The models showed that in all three 
proteins, the 2 amino acid side chains of the COOH-terminal primary zinc binding 

30 motif are in close proximity to a third potential zinc ligand to build a zinc binding 
site, similar to the zinc binding site observed in SEA and SPE-C. 

The zinc binding residues in SPE-C are H167, H201, D203, and it is thought that 
H81 from the HLA-DR1 P- chain binds to the same zinc cation to form a regular 
35 tetrahedral complex. The two ligands of the primary zinc binding motif, H201 and 
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D203, are located on the pl2 strand, which is part of the p-grasp motif, a common 
structural domain of superantigens. The third ligand, H167, comes from the piO 
strand (Roussel et al. 1997). 

5 In the model of SPE-G three potential zinc binding ligands (H167, H202 and D204) 
are located at corresponding positions. In the SMEZ-2 and the SPE-H models, the 
two corresponding p!2 residues are H202, D204 and H198, D200, respectively. The 
third ligand in SPE-H (D160) and in SMEZ-2 (H162) comes from the (39 strand and is 
most similar to H187 in SEA. It has been shown from crystal structures that H167 
10 of SPE-C and H187 of SEA are spatially and geometrically equivalent sites (Scad et 
al. 1997, Embo J 14 no 14:3292-301; Roussel et al. 1997). 

All superantigens examined so far, except SPE-C, bind to a conserved motif in the 
MHC class II al -domain. In SEB and TSST, hydrophobic residues on the loop 

15 between the pi and p2 strand project into a hydrophobic depression in the MHCII 
a 1 -domain. This loop region has changed its character in SPE-C, where the 
hydrophobic residues ( F44, L45, Y46 and F47 in SEB) are substituted by the less 
hydrophobic residues T33, T34 and H35. A comparison of this region on the 
computer generated models revealed that the generic HLA-DR1 a-chain binding site 

20 might also be missing. As the loop regions are generated by random choice, no 
conclusions can be drawn from their conformation in the models. However, in none 
of the three models does the pi-p2-loop have the required hydrophobic features 
observed in SEB and TSST Swaminathan, S. et al., Nature 359, No. 6398:801-6 
(1992), Acharya et al., Nature 367, No. 6458: 94-7 (1994). The residues are 125, D26, 

25 F27, K28, T29 and S30 in SMEZ-2, T31, T32, N33, S34 in SPE-G and K28, N29, S30, 
P31, D32, 133, V34 and T35 in SPE-H. 

SMEZ-2 differs from SMEZ in only 17 amino acids. In the model of SMEZ-2 with the 
position of those 17 residues, most of the exchanges are located on loop regions, 
30 most significantly on the P5-P6 loop with 5 consecutive residues replaced. The 
potential zinc binding site and the P1-P2 loop are not affected by the replaced amino 
acids. 

The TcR Vp specificity differs between SMEZ and SMEZ-2 by one Vp. SMEZ strongly 
35 stimulates Vp2 T cells, but SMEZ-2 does not (Table 2). One or more of the 17 
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exchanged residues in SMEZ/SME2-2 may therefore be directly involved in TcR 
binding. The exact position of the TcR binding site can not be predicted from the 
model as several regions have been implicated in TcR binding for different toxins. 
Crystal structures of SEC2 and SEC3, complexed with a TcR P-chain indicated the 
5 direct role of several residues located on cl2, the £2-03 loop, the P4-P5 loop and a4 
(Fields et al. 1996 Nature 384 no 6605:188-92). On the other hand, binding of TSST 
to the TcR involves residues from a4, the (37-P8 loop and the <x4-p9 loop (Acharya et 
al. 1994, Nature 367 no 6548:94-7). The SMEZ-2 model shows 3 residues, which 
may contribute to TcR binding. In SMEZ, Lys is exchanged for Glu at position 80 
10 and Thr is exchanged for He at position 84, both on the P4-P5 loop. On the COOH- 
terminai end of the a4 helix, Ala is replaced by Ser at position 143. 

The results from the computer modelled protein structures suggest that all 4 toxins, 
SMEZ, SMEZ-2, SPE-G and SPE-H, might bind to the HLA-DR1 p-chain in a zinc 
15 dependent fashion, similar to SEA and SPE-C, but might not be able to interact with 
the HLA-DR1 oc-site, a situation that has so far only been observed with SPE-C 
(Roussel et al. 1997; Li et al. 1997). 

To find out whether or not zinc is required for binding of the toxins to MHC class II, 
20 a binding assay was performed using human LG-2 cells (which are MHC class II 
expressing cells homozygous for HLA-DR1). Direct binding of 125 I-labeled toxins was 
completely abolished in the presence of 1 mM EDTA (Fig. 9, Table 3). When 2 mM 
ZnCh was added, binding to the LG-2 cells could be restored completely. These 
results show that the toxins bind in a zinc dependent mode, most likely to the HLA- 
25 DR1 p-chain similar to SEA and SPE-C. However, it does yet not exclude the 
possibility of an additional binding to the HLA-DR1 a-chain. 
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TABLE 3 

BINDING AFFINITIES AND ZINC DEPENDENCIES FOR SUPERANTIGENS TO 

HUMAN CLASS II 



TOXIN 



MHC CLASS H BINDING 

kd [nM] 



ZINC DEPENDENCY 



SEA 


36/1000 




SEB 


340 




TSST 


130 




SPE-C 


70 




SMEZ 


65/1000 


++ 


SME2-2 


25/1000 




SPE-G 


16/1000 


++ 


SPE-H 


37/2000 





The binding affinities of the toxins to MHC class II were determined by Scatchard analysis using LG-2 cells. 
Zinc dependency was determined by binding of recombinant toxins to LG-2 cells in the presence and absence of 
EDTA, as described in the Materials and Methods section. 



The triphasic binding of SEA to HLA-DR1 can be deduced from Scatchard analysis. It 
shows that SEA possesses a high affinity binding site of 36 nM (which is the zinc 
dependent p-chain binding site) and a low affinity binding site of 1 uM (a-chain 
binding site). On the other hand, only one binding site for HLA-DR1 was deduced 
from Scatchard analysis with SEB, TSST and SPE-C, respectively (Table 3). 

Therefore, Scatchard analysis was performed with radiolabeled rSMEZ, rSMEZ-2, 
rSPE-G and rSPE-H using LG-2 cells. All four toxins showed multiphasic curves 
with at least 2 binding sites on LG-2 cells, a high affinity site of 15-65 nM and a low 
affinity site of 1-2 yM (Fig. 10, Table 3). 
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In a further attempt to determine the orientation of the toxins on MHC class II 
competition binding experiments were performed. The recombinant toxins and 
reference toxins (rSEA, rSEB, rSPE-C and rTSST) were radiolabeled and tested with 
excess of unlabeled toxin for binding to LG-2 cells. The results are summarized in 
Fig. 11. Both, rSEA and rSPE-C, inhibited binding of labeled rSMEZ, rSMEZ-2, 
rSPE-G and rSPE-H, respectively. However, rSPE-C only partially inhibited binding 
(50%) of the labeled rSMEZ-2 (Fig. 12). Recombinant SEB did not compete with any 
other toxin, even at the highest concentration tested. Recombinant TSST was only 
slightly competitive against ^-labeled rSMEZ, rSMEZ-2 and rSPE-G, respectively, 
and did not inhibit rSPE-H binding at all. 



Reciprocal competition experiments were performed. Recombinant SMEZ, rSMEZ-2 
and rSPE-H prevented i25i_ r sEA from binding to LG-2 cells. However, only partial 
15 competition (50%) was observed even at the highest toxin concentrations (10,000 
fold molar excess). Recombinant SPE-G did not prevent binding of i25i_ rS EA and 
^si-rTSST binding was only partially inhibited by rSMEZ, rSMEZ-2 and rSPE-H, but 
not by rSPE-G. Significantly, none of the toxins inhibited i2si_ rS EB binding, even at 
the highest concentration tested. 



In a further set of competition binding experiments, rSMEZ, rSMEZ-2, rSPE-G and 
rSPE-H were tested for competition against each other. Both, rSMEZ and rSMEZ-2 
competed equally with each other and also prevented binding of labeled rSPE-G and 
rSPE-H. In contrast, rSPE-G and rSPE-H did not inhibit any other toxin binding 
25 suggesting that these toxins had the most restricted subset of MHC class II 
molecules, which represent specific receptors. 

SECTION B: GENOTYPING 

30 Genotyping of S.pyogenes isolates 

Purified genomic DNA from all Sir. Pyogerte&gotetes was used for PGR with specific 
primers for the smez, spe-g andsp©-figenes as described above and by Proft (1999). 
In addition, a primerpajp^s^cific to a DNA region encoding the 23S rRNA, oligo 
23rRNA forwaKr^ and oligo 23rRNA reverse 

35 (CTGAAAt^ATCTAAGTAGCTG) was designed and used for PCR as a positive control. 
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Southern blot analysis 

About 5ug of genomic DNA was digested using restriction enzyme Hindlll (GIBCO) 
and loaded onto a 0.7% agarose gel. The DNA was transferred from the gel to a 
5 Hybond-N+ nylon membrane (Amersham) as described by Maniatis (1989). A 640 bp 
DNA fragment of the smez-2 gene was radiolabeled using the RadPrime Labeling 
System (GIBCO) and a 32 P-dCTP (NEN). The nylon blots were hybridized with the 
radiolabeled probe in 2x SSC, 0.5% SDS, 5x Denhards overnight at 65°C. After 
washing twice in 0.2x SSC, 0.1% SDS at 65°C the blots were analysed on a Storm 
1 0 Phosphorlmager . 

RESULTS 

PCR based genotyping was performed in order to determine the frequency of the 
15 genes smez, spe-g and spe-h in streptococcal isolates (Table 4). The PCR primers 
for smez were designed to anneal with both genes, semz and smez-2. 103 isolates 
were collected between 1976 and 1998 from varying sites in patients with varying 
infections, although the majority were from sore throats. They comprised 94 group 
A Streptococcus (GAS) and 9 non-GAS, which were S. agalactiae (group B), S. equis 
20 (group C) and Streptococcus spp (group C). There are 25 distinct M/emm types 
represented among the GAS isolates, 13 isolates are M non-typable (MNT) and in 2 
cases the M type is unknown. The analysis was undertaken blinded to the details of 
each isolate and 2 duplicate isolates were included (95/3 1 and 4202) to 
demonstrate the reproducibility of the testing procedure. The isolates are listed in 2 
25 groups. Group 1 contained isolates collected within a large time frame (1976 to 
1996). Group 2 comprised of isolates collected within a short time (1998). 

All of the 9 non-GAS isolates (belonging to groups B, C and G) were negative for the 
tested sag genes. The frequencies for smez, spe-g and spe-h within the GAS isolates 
30 were 95.6%, 100% and 23.9% respectively. A correlation between a certain M/emm 
type and the presence of the spe-h gene could not be established. The deficiency in 
this current set was that only 5M/emm types were represented by more than one 
isolate. The most frequent serotype was M/emm 12 with 13 isolates, from which 7 
were positive and 6 were negative for spe-h suggesting genetic diversity within the 
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M/emml2 strain. In contrast, all 12 tested NZ1437/M89 isolates were negative for 
spe-h. 

The high frequencies of smez and spe-g is of particular interest as this has not been 
5 described for any other streptococcal sag gene thus far. Other spe genes, like speA, 
speC and ssa are found at much lower frequencies and horizontal gene transfer 
might explain the varying frequencies of these genes in different strains. In 
contrast, both smez and spe-g were found in virtually all tested GAS isolates. Only 
4 GAS isolates (11152, 11070, 94/229 and 11610) tested negative for smez. These 
10 were PT2612, emm65, M49 and emm57. Southern hybridisation was performed to 
find out if the negative PCR results were due to lack of the smez gene or to 
03 lack/ alteration of the primer binding site(s). Hindlll digested genomic DNA of 

: : s 

^ selected streptococcal isolates was probed with a 640 bp radiolabeled smz-2 PCR 

fragment (Fig. 13). The smez gene is located on a 1953 bp Hindlll fragment of about 
'ffi t 15 4kb (fragment B), but not to the SMEZ bearing fragment A (lanes 4, 6, 9, 10). In 

^ addition, the smez probe bound to a second DNA fragment of about 4.2 kb (fragment 

iTl C) in isolate 1 1 152 (lane 4). In the Ml reference strain (lane 1) and in isolate 4202 

f\j (lane 8) the smez probe also bound to fragment B, in addition to fragment A. 

Fragment B in the Ml strain contains a 180 bp region that shares 97% sequence 
h h 20 homology with the 3 J end of the smez gene. These results suggest that the 4 PCR 

negative isolates possess a truncated smez gene or a smez-like sequence, but not a 

complete smez gene. 
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Group 1: I 


so late s co J 


lected between 1976 and 1996 


Strain No. 


Group 


M/exnm 


Site 




T?th DMA 


Spe-g 


Spe-h 


Smez 


Vp8 


FP 1943 


A 


M53 


ts 


ST 








1 + 




FP 2658 


A 


M59 


ts 


ST 






- 1 + 




FP 4223 


A 


M80 


ts 


ST 












FP 54 17 


A 


M41 


ts 


0 1 








I ♦ 




FP 5847 


A 


Ml j ts 


0 1 


+ 




- 1 * 




FP 5971 


A 


M57 | ts 


0 1 


+ 




* I * 




1/5045 


A 


M4 j ts 


ST 




+ 






+ 


79/1575 


A 


\A 1 
IVl x 


ts 


Tcarriage 


+ 


+ 


+ 




+ 


81/3033 | A 


M 12 


ts 


ST 


+ 




+ 


+ 




82/20 


A 




sk 


ulcer 


+ 


+ 


- 




+ 


82/532 


f± 


M 12 


ts 


ST 


+ 


+ 






+ 


82/675 


A 




ws 


wound 






- 




+ 


84/ 14 1 


A 


iVl 1 z, 


ts 


ST 












84/1733 


A 




ts 


ST 




+ 






+ 


84/781 


A. 




ts 


ST 


+ 








+ 


85/1 


A 


D/X 1£ 


ts 


ST 


+ 


+ 




+ 


+ 
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A 
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ts 
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+ 


+ 
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j » 1*1 0 /, g 


ws 
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+ 




+ 


+ 
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ws 
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+ 
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A 


n.d. 














- 
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A 


M4 


ts 




+ 


+ 




+ 


+ 
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A 


M12 


ts 


or 


+ 






+ 


■ + 
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A 
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ts 


O I 




+ 




+ 


+ 
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ts 
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O i 


+ 


+ 






+ 
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A 
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sk 


wound 
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+ 




+ 




89/25 
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LIT 






+ 


+ 


+ 


+ 
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A 


Ml 


ts 
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+ 




89/54 


A 


N21437 § 


ts 


ST 




+ 








90/306 


A 


M5 
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otorrhoea 


+ 






+ 


+ 


90/424 


A 


M4 


ts 


ST 


-r 


+ 




+ 


+ 


91/542 


A 
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ts 


ST 




+ 






4- 


94/11 


A 


NZ1437 § 


ps 


abscess 




+ 




+ 




94/229 


A 
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A 


M4 


ts 
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+ 


+ 


94/354 


A 


M12 


ts 


ST 


+ 


+ 




+ 
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+ 
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A 
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ws 
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+ 




+ 


+ 
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be 
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+ 






+ 
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95/31 


1 M 
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+ 




+ 




95/31(2) 
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ws 
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+ 






+ 


+ 
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+ 


- 1 + 
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ts 
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10902 


6264 


A 
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ts 


ST 
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ts 


ST 


+ 
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- 
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+ 




+ 


- 
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+ 
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ts 


ST 


+ 
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B 




ts 


ST 


+ 
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1653 


G 




ts 


ST 
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B 




ts 


ST 


+ 











Table 4: Genotyping of streptococcal isolates. The isolates were collected between 
1976 and 1996 (group 1) and in 1998 (group 2) from patients with varying diseases. 
The results are based on PCR analysis using purified genomic DNA and specific 
5 primers for each of the sag genes. 

The non Gas are: B, 5. agalactiae; C, S. eqias; G, Streptococcus spp. 
MNT, M non rypable: ts, throat site; ws, wound site; sk, skin; ps, pus site; hvs, high 
vaginal site; be, blood culture; ST, sore throat; SF, scarlet fever; RF, rheumatic 
0 fever, AGN, acture glomerulonephritis; T carriage, throat carriage. 
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* and j, duplicate isolates; §, recently assigned as M89; II, recently assigned as 
M92. 

INDUSTRIAL APPLICATION 

5 

The superantigens of the invention, polynucleotides which encode them and 
antibodies which bind them have numerous applications. A number of these are 
discussed above (including Streptococci subtyping, diagnostic applications and 
therapeutic applications) but it will be appreciated that these are but examples. 
10 Other applications will present themselves to those skilled in the art and are in no 
way excluded from the scope of the invention. 

It will also be appreciated that the foregoing examples are illustrations of the 
invention. The invention may be carried out with the numerous variations and 

15 modifications as will be apparent to those skilled in the art. For example, a native 
superantigen may be replaced by a synthetic superantigen with on or more 
deletions, insertions and/ or substitutions relative to the corresponding natural 
superantigen, provided that the superantigen activity is retained. Likewise there 
are many variations in the way in which the invention can be used in other aspects 

20 of it. 
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